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Abstract 

Using a probabilistic approach we study the parallel dynamics of 
fully connected Q-Ising neural networks for arbitrary Q. A Lya- 
punov function is shown to exist at zero temperature. A recursive 
scheme is set up to determine the time evolution of the order pa- 
rameters through the evolution of the distribution of the local 
field. As an illustrative example, an explicit analysis is carried 
out for the first three time steps. For the case of the Q = 3 model 
these theoretical results are compared with extensive numerical 
simulations. Finally, equilibrium fixed-point equations are de- 
rived and compared with the thermodynamic approach based 
upon the replica-symmetric mean-field approximation. 
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1 Introduction 

The parallel dynamics of extremely diluted asymmetric and layered feed- 
forward Q > 2-Ising neural networks have been solved exactly (cfr. and 
the references cited therein). This has been possible because in these types of 
networks one knows that there are no feedback loops as time progresses. In 
particular, this allows one to derive recursion relations for the relevant order 
parameters of these systems: the main overlap for the condensed pattern, the 
mean of the neuron activities and the variance of the residual overlap respon- 
sible for the intrinsic noise in the dynamics of the main overlap (sometimes 
called the width-parameter). 

These results are in strong contrast to those for the parallel dynamics of 
networks with symmetric connections. For these systems it turns out that 
even in the diluted Q = 2 case, feedback correlations become essential from 
the second time step onwards, which already complicates the dynamics in 
a nontrivial way For fully connected Q = 2 systems, an increasing 

complexity of such long-term temporal correlations makes the dynamics, in 
general, extremely complicated. Therefore, either approximate treatments 
of the feedback influence on the network evolution or only the first few time 
steps of the main overlap evolution have been analyzed so far. Nevertheless, 
this has led to some important insights into the dynamics of the Little- 
Hopfield model (cfr. f7j-[l_5 and references therein). 



In this paper we consider the zero-temperature parallel dynamics of fully 
connected Q-Ising neural networks for general Q. Generalizing a Q = 2 result 
from the literature ||16||-||17|| we find that there exists a Lyapunov function 



leading to the occurrence of fixed-points and two-cycles. Since two-cycles in 
the Q = 2 Little- Hopfield model seem to appear far from the retrieval region 



|18| and/or seem to involve only a tiny fraction of all spins | 19fl , we only look 
at the fixed-point dynamics. 



Using a probabilistic approach |13] we extend our analysis for extremely 



diluted fl,|]2U| and layered Q-Ising networks M to the non-trivial case of 
fully connected systems at zero temperature. In particular, we develop a 
recursive scheme to calculate the relevant order parameters of the system, 
i.e., the main overlap, the activity and the variance of the residual overlap, 
for any time step. We write out these expressions in detail for the first 
three time steps of the dynamics. Furthermore, under the condition that 
the local field becomes stationary we derive the fixed-point equations for 
these order parameters. They are found to be the same as those derived via 
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thermodynamical methods Jzjj. Finally, extensive numerical simulations for 
the Q = 3 model are compared with the theoretical results. 

The rest of the paper is organized as follows. In Section we introduce the 
model, its dynamics and the Hamming distance as a macroscopic measure for 
the retrieval quality. In Section || we use the probabilistic approach in order 
to derive a recursive scheme for the evolution of the distribution of the local 
field, leading to recursion relations for the order parameters of the model. 
Using this general scheme, we explicitly calculate in Section ^ the order 
parameters for the first three time steps of the dynamics. In Section [| we 
show the existence of a Lyapunov function at zero temperature and we discuss 
the evolution of the system to fixed-point attractors. A detailed discussion of 
the theoretical results obtained in Section [| and a comparison with extensive 
numerical simulations are presented in Section || Some concluding remarks 
are given in Section [7|. 



2 The model 

Consider a neural network A consisting of N neurons which can take 
values (jj from a discrete set S = { — 1 = si < s 2 < ... < sq = +1}. Given 
the configuration cr A (t) = {&j(t)},j G A = {1, . . . , N}, the local field in 
neuron i equals 

hi(cr A \ {i} (t)) = J ij<Tj{t)> (!) 

ieA\{<} 

with Jjj the synaptic couplings between neurons i and j. In the sequel 
we write the shorthand notation h\j(t) = hi((Tj\\u\(t)). The configuration 
cr\(t = 0) is chosen as input. At zero temperature all neurons are updated 
in parallel according to the rule 

<Ti(t) -> (Ji{t + 1) = s k : mm €i[s\(T A \ {i} (t)] = €i[s k \a A \ {i} (t)] . (2) 
Here the energy potential ej[s|cr A \{j}] is defined by 

£i[s\°-A\{i}] = -7}[ht(vA\{i})s - bs 2 } , (3) 

where b > is the gain parameter of the system. The updating rule (0) is 
equivalent to using a gain function g 6 (-), 

<7i(t+l) = g b (h A>i (t)) 
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Q 

Sbi x ) = s k 1° iK s k+i + s k ) - x] - 9 [b(s k + s fc _i) - x]] (4) 

k=l 

with so = — oo and sq + i = +oo. For finite Q, this gain function g 6 (-) is a 
step function. The gain parameter b controls the average slope of g&(-). 

In this network we want to store p = aN patterns. These patterns 
are a collection of independent and identically distributed random variables 
(i.i.d.r.v.), {£f 6 S}, ji G V = {l,...,p} and i G A with zero mean and 
variance A = Var[£f]. The synaptic couplings between the neurons are chosen 
according to the Hebb learning rule 

J* = ^jY,&Zs for Mj, Ju = 0. (5) 

To measure the retrieval quality of the system one can use the Hamming 
distance between a stored pattern and the microscopic state of the network 

d(e,* A {t)) = ±Y,[$-°M 3 - ( fi ) 

This naturally introduces the main overlap 

<® = 4zY,&°i<t) ^ v ( 7 ) 

igA 

and the arithmetic mean of the neuron activities 

aA(t) = i£M*)] 2 - (8) 

igA 

3 General dynamical scheme 

It is known that contrary to the asymmetrically diluted and layered neu- 
ral networks, the parallel dynamics of fully connected systems, even at zero 
temperature, is not exactly solvable because of the strong feedback correla- 
tions J22|. 

On the basis of the probabilistic approach used before (see, e.g., p}| , fll3| ) 
we develop in this section a recursive dynamical scheme in order to calculate 
the distribution of the local field at a general time step, for Q > 2-Ising neural 
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networks. This results in recursion relations determining the evolution of the 
order parameters of these systems. 

Suppose that the initial configuration of the network {o"j(0)}, i G A, is a 
collection of i.i.d.r.v. with mean Eja^O)] = 0, variance Var[<7j(0)] = a , and 
correlated with only one stored pattern, say the first one 



E[£Vi(0)] = S^mlA m] > 



(9) 



This implies that by the law of large numbers (LLN) one gets for the main 
overlap and the activity at t — 



mVO) = lim m A (0) 



a(0) 



lim a^(0) 

N^oo 



Pr 1 
A 

E[o?(0)] 



£[^(0)] = 
a 



(10) 
(11) 



where the convergence is in probability P3"| . Using standard signal-to-noise 
techniques (see, e.g., [pQ|), we find the local field at t = 



hi(Q) 



v 



lim 

"1^1 



e>A\«(o) + 



E E ere^i(o) 



MeP\{i}i6A\{i} 



^m^O) + 7v r (0,aa ) 



;i2) 



where the convergence is in distribution (see, e.g., p3|). The quantity jV(0, d) 
represents a Gaussian random variable with mean and variance d. 

For a general time step we find from eq. (^) and the LLN in the limit 
N — ► oo for the main overlap (0) and the activity (§) 



m 1 ^ + 1) 
a(t+l) 



Pr 



1 



«g2(fc(*))» 



(13) 
(14) 



with /ij(t) = limTv^oo h\ t i(t). In the above ((•)) denotes the average both over 
the distribution of the embedded patterns {£f } and the initial configurations 
{o"i(0)}. The average over the initial configurations is hidden in an average 
over the local field through the updating rule (^). From the study of layered 
networks (0-0]) we know already that due to the correlations there will be 
a third important parameter in the description of the time evolution of the 
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system: the influence of the non- condensed patterns which is expressed by 
the variance of the residual overlaps 

D(t) = Vax[r»(t)] (15) 

r"(f) = Km r£(t)= lim -J— ££>,(*) ^EV\{1}. (16) 



Clearly, D(0) = a Q /A. The distribution of the embedded patterns is given. 
It is the purpose of this section to calculate the distribution of the local field 
function of time. 

We start by rewriting the local field (P at time t in the following way 

jeA 

= ^ m i(t)_ ao - i ( t )+ 1 ^ tfV£( t ). (18) 

ViV M6n{ i } 

From a technical point of view the explicit addition and subtraction of the 



ctai(t) term in (JIT ) is convenient in order to treat all indices in the sum over 
j on an equal footing. This turns out to be important to take into account 
all possible feedback loops. 

We would like to remark that the set of aN variables {Cf r A(*)}/j appear- 
ing in the last term of (|18|) are not independent because the £f are weakly 
dependent on the r^(t), v ^ //. On the contrary, in the case of layered or 
diluted networks all terms of this set of variables are independent such that 
their sum is a normal distribution. Moreover, and £f are also in- 

dependent implying that the mean and the variance of this distribution are 
known directly. The same is true for the fully connected model at time t = 
where the network states {o"j(0)}j are randomly distributed and independent 
of the non-condensed (/i > 1) embedded patterns. But after applying the dy- 
namics the <7j(t) and the £f become dependent, leading to a weak dependence 
of ^A\{i}(^) an d 6f ■ This microscopic dependence gives rise to a macroscopic 
contribution after summing and taking the limit N — > oo. In this respect 
we mention that in ref. || an approximation has been put forward by ne- 
glecting precisely these correlations between the £f and the r%\u\(t). For an 
overview of improvements of this approximation for Q = 2 and corresponding 



numerical simulations we refer to |L5|. 



In order to determine the structure of the local field for fully connected 
networks, we first concentrate on the evolution of the residual overlap r^(t), /i e 
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V\{1}. As mentioned above the dynamics induces the dependence of {(Ji(t)} 
and To study its consequences we rewrite the residual overlap ( |16D as 



1 Efe(^W + 4^eM(t)) 



NA 



N' 



with 



KM = KM 



(19) 



(20) 



By subtracting the term ^fr^(t) / 'yN the modified local field h ki (t) becomes 
only weakly dependent on tjg , whereas fiA,i(t) depends strongly on The 
gain function g 6 (-) is a step function that changes its value by s k+ i — s fc at 
b(sk + Sfc+i), = 1, . . . , Q — 1. Hence the term ^r^{t) / \/N in ([H^) becomes 
relevant if for some k 



\hUt) - b(s k + s k+1 )\ < k 



N 



l,...,Q-l. 



(21) 



Denoting by Ik the set of indices satisfying condition (pi), we split the resid- 
ual overlap into two sums: 



r£(t + l 
1 



E + -7^77 E E er &(^(*)) 



iVv4 



fc=l iG/fe 



+ - 



sign(6(s fc + s fc+ i) - /^(t)) + sign(-=£f<(£)) 



(22) 



In the argument of gj,(-) in the first term of (|22|) the term Cf r A(0/v-/V is 
left out since it can not change the value of g 6 (-) by definition of the sets Ik- 
Combining the first two terms, eq. (E^) can be rewritten as 



1 



k?/ gAi(t))+ vfe 



EEC 

fe=l i£/fc 



X- 



sign(6( Sfc + Sfc+1 ) - ^ .(*)) + sign(-=#V£(t)) 



AT 



(23) 
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We then consider the limit N — ► oo. In this limit the cardinal number of the 
set Ik becomes deterministic 



\l 



lim 5| = 2|^(*)lAM (t) (6(ajH-i + **)) (24) 



with /^(im the probability density of the modified local field hf(t) at time t. 
We remark that in the thermodynamic limit the density distribution of the 
modified local field hf (t) at time t equals the density distribution of the local 
field hi(t) itself. 

Furthermore, we apply the CLT on the first term of (^) and the LLN 
on the second term with the random variable r^(t) fixed. This yields the 
following result 

r"(t + 1) = + x(ty i (t) (25) 
where, recalling eqs. flT4]) and (p0|) 



f"(t) = lim * £tfg 6 (A£.(t)) = !)M) (26) 

because of the weak dependence of hf (t) and £f , and 

Q-l 

= Z fh»(t)( b ( s k+i + s fc ))(sfc+i - s fc ) . (27) 
fe=i 



From the relation (|25|) one finds a recursion relation for the variance of 
the residual overlap 

D(t + 1) = ^-t^ + X 2 {t)D{t) + 2 X (t)Cov[r»(t),r< 1 (t)} . (28) 

At this point it is interesting to remark that the last term on the r.h.s. of 
(28| ) is entirely coming from the correlations caused by the fully connected 
structure of the network. It is absent for layered (compare eq. (30c) of ref. M) 
and hence, of course, also for extremely diluted asymmetric architectures. In 
the latter case also the second term on the r.h.s. of (p8 ) disappears. 

Starting from the local field at time t + 1 in the form of eq. ( |T8"D and using 
expressions (|25"D , fl26|) and (^) we obtain in the limit A" — > oo, after some 
straightforward manipulations 

hi{t+l) =^m 1 (t+l)+x(t)[/i i (t)-^m 1 (t)+a«T f (t)]+jV(0,aa(t+l)). (29) 
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From this it is clear that the local field at time t + 1 consists out of a discrete 
part and a normally distributed part, viz. 

h i (t) = M i (t)+M(Q,V(t)) (30) 

where Mi(t) satisfies the recursion relation 

Mt(t + 1) = x(t)[Mi(t) - gm 1 ® + a*i(t)] + £\m\t + 1) (31) 

and 

V(t + l) = aAD(t+l) (32) 

with D(t + 1) given by the recursion relation (p8| ). 

We still have to determine fhi(t) m ec L- (P?D- We know that the quantity 
Mj(i) consists out of the signal term and a discrete noise term, viz. 



t-i 



f'=0 L=i' 



t-i 



(33) 



The evolution equation tells us that crj(t') can be replaced by gb{hi(t' — 1)) 
such that the second term of Mj(t) is the sum of stepfunctions of correlated 
variables. These are also correlated through the dynamics with the normally 
distributed part of hi(t). Therefore the local field can be considered as a 
transformation of a set of correlated normally distributed variables x s , s = 
0, . . . ,t — 2,t. Defining the correlation matrix C ss i = E[x s x s /] we arrive at 
the following expression for the probability density of the local field at time 



fhi(t)(y) 



f [] dx s dx t d(y- Mi(t) - JV(t)x t 

J s=0 ^ 

= exp f-^x^C^x 
det(27rC)^(t) V 2 



(34) 



with x = (x , ■ ■ ■ x t -2,x t ). 

Together with the eqs. (13 ) - (14) for m x (t + 1) and a(t + 1) the equations 
and (i 



(|25|)- (|28|) , ( pTj ) and ( [34]) form a recursive scheme in order to obtain the order 
parameters of the system. The practical difficulty which remains is the ex- 
plicit calculation of the correlations in the network at different time steps as 
present in eq. (EH). 
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4 Evolution equations up to the third time 
step 

Following the general recursive scheme established in Section |3] evolution 
equations are derived for the order parameters of a fully connected Q-Ising 
network for the first three time steps, taking into account all correlations. 
This generalizes and extends the Q = 2 results in the literature mentioned 
in the Introduction. 

4.1 First step dynamics 

Starting from eqs. ([Tj),(0) and (IT3) one has immediately 

m\l) = j J Vz fctfWCO) + y/aAD(p) *))) (35) 
o(l) = ((/ Vz gg^WCO) + JaAD(0) *))) , (36) 

where ((• • •)) now stands for the average taken with respect to the distribution 
of the first pattern and the initial configuration and T>z denotes a Gaussian 
measure Vz = dzexp(—^z 2 )/^/2rr. We recall that D(0) = a /A. Next, 
from the initial conditions ©-(0) and the definition of the modified local 
field (pOD one also knows that {£f , g&(/if (0))}j and <7,(0)}j become a set 
of uncorrelated parameters for /i £ V \ {1}. Here h^{t) = limjv^oo ^a 
Therefore 

Cov[P(0), r"(0)] = E^O)^ (0))] . (37) 
Using the recursion relation (|28|) this leads to 

D(l) = ^ + X 2 (0)D(0)+2^(^a(0)Jvz g^m^O) + J<*AD(0) z))) 

(38) 

with 

X(0) = ((-y=^J / ^ * ^(^(0) + y/aAD{0)z)JJ . (39) 

These results generalize the corresponding Q = 2 results (see, e.g., and 
PI)- 
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4.2 Second step dynamics 

First we need the distribution of the local field at time t — 1. This follows 
immediately from eqs. ( |3lD and (|32|) 

^(1) = e> 1 (l) + x(0)[^(0)-C 1 ^ 1 (0)+a^(0)]+AT(0,aa(l)).(40) 

Recalling again eqs. ([K|) and (|T^D, the main overlap and the activity read 

m 1 (2) = i ^ / ^ & (V^l) + ax(0)tr(0) + V^O) *) ))(41) 

a(2) = ^ j Vz g 2 b (eW(0) + a*(0)a(0) + yWLD(l) z) (42) 

and 

X(l) = ^T=^y ((/ Vz z & + «x(0)ff(0) + ^aAD{l) z yjj . 

(43) 

These equations correspond to the equations for the Q = 2-network found 
in M. The calculation of the third order parameter, i.e., the variance of the 
residual overlap, needs some more work. From the recursion formula (25) 
one finds 

Cov[P(l),r^(l)] = Cov[P(l),P(0)] +x(0)Cov[P(l),r^(0)] (44) 
= Ii?(2,l) + X (0)ii?(2,0) (45) 

with the correlation parameters, R(t,t), defined as 

R(t,t) = ((g 6 (^(t-l))g 6 (^(f-l)))) t,t>\ (46) 
R(t,0) = ((a(0)g b (h^t-l)))) t>l. (47) 



This is based on the fact that by definition of the modified local field fl2"0|) 
{£f j Sb(^f an d {£f> become a set of uncorrelated variables. These 

results lead to the recursion relation (recall eq. (PH)) 



D{2) = ^ + X\1)D{1) + 2^ (R(2, 1) + X (0)#(2, 0)) . (48) 

We still have to determine the R(t, t). The correlation R(2, 0) can be written 
down immediately again by using the definition of the modified local field at 
t = 1 

#(2,0) = ^a(O) J Vz g b (eW(l) +a X (0)<7(0) + yJaAD(l)z^ . (49) 
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To obtain R(2, 1), one remarks that due to the dependence of <7j(0) and 0i(l) 
the local fields hi(l) and hi(0) are correlated. The correlation coefficient of 
their normally distributed part in general defined as 

is found using the recursion formula ([40]) 

aR{l,0) + aA X (0)D(0) (51) 



'aAD(0)^aAD(l) 
Employing all this in eq. fl46|) we arrive at 

R(2,l) = J Vw^ix, y) g b (CW(0) + y/aAD(p) x 

x g b ^W(l) + a X (0)a(0) + ^aAD(l)y^ . (52) 

Here the joint distribution T>w 1,0 (x, y) equals 

10 dx dy ( x 2 - 2p(l,0)xy + y 2 \ 

Vw'ix.v) = = exp ; — . 53 

2tt v /1-p(1,0) 2 V 2(l-p(l,0) 2 ) J V ' 



We remark that, for Q = 2, the result ( j48|) is slightly different from 
the corresponding result in [|TJ| (see, e.g., their eq.(39)). In more detail, in 
their approach these authors make an ansatz stating the independence of 
the normally distributed and discrete part in the noise arising from t > 2 
onwards. They explicitly state that they have no convincing arguments in 
favour of (as well as against) this ansatz for t > 2. In our approach we do not 
need this ansatz. This results in a more complicated expression for R(2, 1) 
than the corresponding one found in |14[], indicating that this ansatz really 
ignores some correlations. In fact both expressions coincide if we put p(l,0) 
equal to zero. 

4.3 Third step dynamics 

We start by writing down the distribution of the local field at time t = 2. 
From eqs. (|31|) and (|3^) we find 

hi(2) = e> 1 (2) + a X (l)[<Ti(l) + x(O)0i(O)] + JV(0, a AD (2)) . (54) 
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This gives for the main overlap 



m\3) = j {^egb [^m\2) + a X (l) [a (I) + x(0)<r(0)] + ^JaAD(2) y 

(55) 

with y the Gaussian random variable A/"(0, 1). The average has to be taken 
over y, <Jj(0) and <7j(l). The average over cxj(0) causes no difficulties be- 
cause this initial configuration is chosen randomly. The average over y, the 
Gaussian random variable appearing in hi(2), and <Ji(l) is more tricky be- 
cause hi(2) and erj(l) are correlated by the dynamics. However, the evolution 
equation (f|) tells us that crj(l) can be replaced by g b (/ij(0)) and, hence, the 
average taken over hi(0) instead of <7j(l). 

From the recursion relation fl29|) one finds for the correlation coefficient 
between hi(0) and h{(2) 



(56) 



Q = a[R(2,0) + x(l)[R(l,0) + x(0)a 
^JaAD(0)^JaAD(2) 

Using all this the main overlap at the third time step fl55| ) becomes 
m\3) = j^ 1 Jvw 2 >°(x,y)g b (em 1 (2) + 



«x(i) 



56(^(0) + y/aAD{0) x) + x(0>(0) +y/aAD(2)y 



where the joint distribution of x and y equals 



Vw 2 '°(x,y) 



dx dy 



27^1-/7(2, Of 



exp 



x 2 -2p(2,0)xy + y 2 
2(l-p(2,0) 2 ) 



(57) 



(58) 



In an analogous way one arrives at the expression for the activity at the third 
time step 



o(3) 



Vw 2 '°(x,y)g 2 b (fW(2)+ 
ax(l) L(eW(0) + yWLD(O)x) + x(0)<7(0)l + y/aAD(2) : 



(59) 
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In order to find the variance of the residual overlap at the third time step, 
D(3), we start by rewriting eq. (p8|) as 

£>( 3 ) = ^1 + x 2 (2) + 2 X|) (jR(3; 2) + (i2(3j 1} + x(0)jR(3; 0))) (6Q) 



with 



JaAD(2)(l-p(2,0y)J J V 



+ ax(l) (W^O) + y/aAD(0)y) + x(0)a(0) 



+ v /aA J D(2)(l-p(2,0) 2 )z + ^aA J D(2)p(2,0) 2 /))). (61) 

Here we have used the recursion relation fl25|) for r M (2) and the fact that 
{£f j gfe(^f(0))}j and {£f , cjj(0)}j become a collection of uncorrelated variables 
for \i 6 V \ {1}. We then have to calculate the correlations R(3, 0), R(3, 1) 
and R(3, 2). From the definition (^6|), the local field fl54|) and the joint 
distribution (|58D one easily arrives at 

R{3,0) = ((a(0)Jvw 2 > (x,y)g b (em 1 (2) + 

a X (l) [^W(O) + ^aAD(0) x) + x(0)a(0)] + y/aAD(2)y^ (62) 

■R(3, 1) = ((/ Vw 2 >°(x,y)g b (Vm^O) + y^AD(0)x) g fc (£W(2) + 

a*(l) fg^m^O) + ^/aAfl(O)x) + x(0)a(0)l + J a AD (2) y))) . (63) 



Finding i?(3, 2) is more tricky since, after rewriting the network configura- 
tions {(Ji{t)} at time t = 1 and t = 2 by means of the gain function (^), the 
local fields at the three first time steps appear. So one has to calculate the 
elements of the correlation matrix of these local fields in general defined by 



C(t, t) = yj aAD(t)^J aAD(t) p(t, t) . (64) 

The correlation coefficients p(l,0) and p(2,0) have been calculated already 
before (recall eqs. (|51|) and (p6[) ). The correlation coefficient p(2, 1) of h(2) 
and h(l) is found by using the recursion relation (p9|) as 

a [R{2, 1) + x(0)i?(2, 0) + x(l)^(l)] (g5) 
/aAD(2)JaAD(l) 
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The distribution function T>w 2,1 (x, y, z) of the three local fields equals 



Vw > (ar, y, z) 



dx dy dz 



(27r) 3 /VDetw 



2.1 



exp 



V 




(w 2 ' 1 )- 1 (x y z) 



where 



w 



.2,1 



(ctAD{0) C(1,0) C(2,0)\ 

(7(1,0) aAD(l) C(2,l) . 
\C(2,0) C(2,l) aAD(2)J 

Finally, using all this information one gets for the correlation parameter 



(66) 
(67) 



#(3,2) 



Vw 2 '\x } y, z) g b teWtl) + a X (0)<r(0) + V 



g b (em\2) + a X (l) [g 6 (e^^O) + x) + x(0)a(0) 



+ 2 



.(68) 



These results can be compared with those for extremely diluted systems. 
If the dilution is symmetric (see refs. for the case Q — 2) feedback 

loops over two time steps can exist, but the probability to have loops over a 
longer time period equals zero. Therefore the <Tj(0)-term in fl5*4| ) drops out. 
Furthermore in the Q = 2 case the expression for the correlation coefficient 
(]56|) simply reads p(2, 0) = R(2,0). If the dilution is asymmetric @, all 
feedback disappears and the local field is simply Gaussian distributed. 



5 Fixed-point equations 

A second type of results can be obtained by requiring through the recur- 



sion relations (28), (31) and (p2|) that the local field becomes stationary. We 
show that this leads to the same fixed-point equations as those found from 



thermodynamics in pi 



For the Q-Ising model at zero temperature one can show that 

H(t) = - J £ ( E J^(tm) - b{a 2 {t) + a 2 (t)) 
ieA yeA 

with <7j(i) chosen such that 

€i[ai(t)\(TA\{i}{t)) = mm ei[s\a A \ {i} (t)} , 



(69) 



(70) 
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is a Lyapunov function. For finite N, H(t) is bounded from below implying 
that H (t + 1) — H(t) = after finitely many time steps. This can be realized 
for <7i(t + 2) = <Ji(t) \/i G A. The proof is straightforward and completely 
analogous to the argumentation used in fll6fl,[fn|l. Both a fixed point and a 



two-cycle satisfy this condition. As stated in the introduction we only study 
fixed-points. 

Since the evolution equations for the order parameters in the extremely 
diluted and layered Q-Ising models do not change their form as time pro- 
gresses, the fixed-point equations are obtained immediately by leaving out 
the time dependence (see @,@]). This still allows small fluctuations in the 
configurations 

Since in the fully connected model treated here the form of the evolution 
equations for the order parameters do change by the explicit appearance of 
the {<7j(t)},t > 0, we can not use that procedure to obtain the fixed-point 
equations. Instead we require that the distribution of the local field becomes 
independent of time. This is a stronger condition because fluctuations in the 
network configuration are no longer allowed. Consequently, the main overlap 
and activity in the fixed-point are found from the definitions (^), (§) and not 
from leaving out the time dependence in the recursion relation (|1^) and (p!4[). 



The same line of reasoning is followed in, e.g., [p^|,p5 . 

We start by eliminating the time-dependence in the evolution equations 
for the local field (0). This leads to 

hi = & m 1 + JV(0, aa) + a-^o l (71) 

1 - X i - X 

with hi = lim^oo hi(t). This expression consists out of two parts: a normally 
distributed part hi = jV^m 1 , aa/(l — x) 2 ) and some discrete noise part. We 
remark that this discrete noise coming from the correlation of the at 
different time steps is inherent in the fully connected dynamics. 

Employing the expression eq. ( |7lD in the updating rule one finds 



Oi = g b (hi + oerjai) r) = x/(l - %) . (72) 

This is a self-consistent equation in <jj which in general admits more than 
one solution. This type of equation has been solved in the case of analog 
neural networks with continous time dynamics using a Maxwell construction 



24|1,||25||. Such a construction is standard in thermodynamics in order to 
maximize the exponent of the integrand appearing in free energy calculations. 
Here we use a similar geometrical construction to treat eq. ([72]). 
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Let L be the straight line which connects the centers of the plateaus of 
the gain function g b (-). The equations for the functions g b (-) and L(-) read 

g b : x i-> s fc if 6(s fe + s k -i) < x < b(s k + s k +i) (73) 
t : (74) 

The condition on the r.h.s. of (|73| ) is a condition on x. Using the definition 
of L(-), one can transform this into a condition on the image of L(-), Tl = 
{y G H | 3 x G R : = y}, viz. 

= if < < • (75) 

Consider the transformation T : (x, y) i— > (a; — 0:772/, ?/) 

r ( £ ) = * M 2(^fj- (76) 

The function T(g 6 )(-) is not bijective while T is not one-to-one. To obtain 
a unique solution for eq. flT2| ) we modify the former function such that it 
becomes a step function with the same step height as the one in T(g 6 )(-) and 
the width of the steps such that T(L) connects the centers of the plateaus: 

T L (g b )(x) = s k if (b - -y)(«fc + Sfc-i) < £ < (b - -y)( s fc + (77) 
or, using (|) 

T L (g b )(x) = g~ b (x) with b = b-^. (78) 

This at first sight ad-hoc modification leads us to a unique solution of the 
self-consistent equation ([72p. Indeed, from this modified transformation we 
know that 

g b (h + ar]a) ~ gg(fc + ar/a - ar/g b (/i + ar)o)) , (79) 

such that 

n = gi(hi) ■ (80) 

At this point we remark that plugging this result into the local field 
equation ([n]) tells us that the latter is the sum of two Gaussians with shifted 
mean (see also 
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Using the definition of the main overlap and activity (0) and (|8]) in the 
limit iV — > oo, one finds in the fixed point 



m 1 



V / Vz g- b (^m 1 + V^AD z) Jj (81) 

'J Vz g\ (^m 1 + VaAD z) ^ . (82) 

From (||), (HD and (J27|) it is clear that 

a/A , . 

= o4f (83) 

v^4C \\/ 1,2 * K ( f ' ml + V « 17 ' 1 ))) • < 84 > 
These resulting equations (pT])- (p3|) are the same as the fixed-point equations 



with 

1 

X 



derived from a replica-symmetric mean- field theory treatment in pi] . Their 
solution leads to the a — b phase diagram Fig. lb in pi| . We end with 
the observation that for analog networks the construction (f^)- ([78|) is not 
necessary: the fixed-point equation (|[2|) has only one solution. 



6 Numerical simulations 

As an illustrative example the equations derived in Section ^ have been 
worked out explicitly in the case of the Q = 3 model with equidistant states 
and a uniform distribution of the patterns (A = 2/3). 

For this model a thermodynamic replica-symmetric mean-field theory 
approach leads to a capacity-gain phase diagram discussed already in |2l| 



(Fig. lb). As explained in Section |5| the same phase diagram can be ob- 
tained through the dynamical approach presented here. For convenience and 
completeness this phase diagram is reproduced here as Fig. 1. At this point 
it is also useful to recall that there are two types of retrieval states. In region 
I the mean-square random overlap with the non-condensed patterns, r, is of 
order 0(1) while in region II r is of order 0(10) |f2~T| . 

For specific network parameters corresponding to different points in the 
retrieval region of this equilibrium phase diagram, indicated as 1 to 4, we have 
compared the dynamics governed by the evolution equations found here with 
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extensive simulations involving system-sizes up to N = 6000 (each data point 
is averaged over 1600 runs). 

Figures 2-5 present an overview of these results by plotting the overlap 
m 1 (t), the activity a(t) and the Hamming distance d(t) versus the initial 
overlap m\ with the condensed pattern. (We forget about the superscript 1). 
The initial activity is taken to be a = 0.85. 

First we consider region I. For network parameters corresponding to point 
1 below the thermodynamic transition line, i.e., a = 0.005, b = 0.3, we see in 
Fig. 2 that for mo > 0.33 the dynamics quickly evolves to an overlap m = 1 
and that the Hamming distance is zero for m > 0.37. The activity at- 
tains the value 2/3, meaning that the network configuration is uniformly dis- 
tributed. The boundary between the m = 1 attractor and the zero-attractor 
is rather sharply determined. 

For a network corresponding to point 2 above the thermodynamic tran- 
sition line, with a = 0.03, b = 0.5, we need a larger value of m to reach the 
m = 1 attractor and a Hamming distance zero. As seen in Fig. 3, m has to 
be at least 0.75. Also the boundary between the m = 1 attractor and the 
zero-attractor is less sharply determined. Figure 4 shows that this behavior 
is qualitatively the same for a = 0.009, b = 0.7, corresponding to point 3 
situated above the spin-glass transition in the phase diagram. In this case 
the value of m has to be at least 0.85. For the other network parameters we 
have looked at, e.g., a = 0.0115, b = 0.5 the global behavior is similar. 

For network parameters corresponding to points in region II of the phase 
diagram, e.g., point 4 with a = 0.015,6 = 0.1 it is shown in Fig. 5 that the 
main overlap goes to its maximum value for almost all values of mo- The 
basin of attraction of the zero fixed-point is zero. The activity, however, goes 
to a value larger than 2/3. The network configuration is no longer uniformly 
distributed: the state <7j = has a smaller probability to appear than the 
states Oi = ±1. Hence, the Hamming distance is never zero. This must 
be due to the fact that the influence of the non-condensed patterns is much 
larger here (r « O(10)). The same qualitative behavior is found for network 
parameters corresponding to points in region II below the thermodynamic 
transition line, e.g., a = 0.005, b = 0.1. 
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7 Concluding remarks 

In this paper we have derived the evolution equation for the distribution 
of the local field governing the parallel dynamics at zero temperature of fully 
connected Q-Ising networks, taking into account all feedback correlations. 
This leads to a general recursive scheme which allows us to calculate the 
relevant order parameters of the system, i.e., the main overlap, the activity 
and the variance of the residual overlap, for any time step. We have worked 
out this scheme explicitly for the first three time steps of the dynamics. 

Under the condition that the local field becomes stationary we have also 
obtained the fixed-point equations for these order parameters. They are 
found to be the same as those derived via thermodynamic methods |21 . 



As an illustration we have presented a detailed discussion of these results 
for the Q = 3-model and we have made a comparison with extensive nu- 
merical simulations. It is seen that these numerical results provide excellent 
support for our theoretical predictions and that the first three time steps do 
give already a clear picture of the time evolution in the retrieval regime of 
the network. 
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Figures 



Figure 1: The a — b phase diagram (see [21] figure l.b). 
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Figure 2: A comparison of the theoretical results and numerical simula- 
tions for systems with N = 6000. The overlap m(t), the activity a(t) and 
the Hamming distance d(t) are presented for the first three time steps as a 
function of m for the network parameters b = 0.3, a = 0.005, a = 0.85. 
Theoretical (simulations) results for the first, second and third time step are 
indicated by a short-dashed curve (diamond symbol), a long-dashed curve 
(times symbol) and a full line (triangle symbol) respectively. 
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m„ 



Figure 3: As in Fig. 2, for the network parameters b = 0.5, a = 0.03, a = 
0.85. 
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0.0 0.5 1.0 

Figure 4: As in Fig. 2, for the network parameters b = 0.7, a = 0.009, a = 
0.85. 




Figure 5: As in Fig. 2, for the network parameters b = 0.1, a = 0.015, a 
0.85. 



